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XML GENERATOR FOR OBJECTS WITH BINARY BRANCHING TOPOLOGY 
FIELD OF THE INVENTION 

[0001] The field of invention relates generally to pattern extraction from images 
where the extracted pattern is later used for data storage and modeling and, more 
5 specifically but not exclusively relates to techniques for defining the topology of 
images using XML. 



BACKGROUND INFORMATION 

[0002] The extraction of the topology of shapes is a basic building block for 
10 modeling physical processes. Many natural topologies (neural anatomy and the 
botanical structure of plants) can be described by a binary branching tree structure 
(Verwer R W H, J Van Pelt, 1983, "A new method for the topological Analysis of 
Neuronal Tree Structures." J Neurosci Meth, 8, 335-351). Many organic processes 
such as the electrophysiological processes of the neuron are influenced by their 
15 morphological properties (Ascoli A. Giorgio, Progress and Perspectives in 
Computational Neuroanatomy, Anatomical Record (The New Anatomist section), 
257(6): 195-207 (1999)). 

[0003] The current state of the art models these objects as a connected series of 
elements. The geometry of the object is described by listing the coordinates of the 
20 elements along with a reference to the elements parent or child. This referencing 
system allows the topology of the object to be maintained. A drawback of this 
method is that adjacent elements in the object may not be adjacent in the file 
structure. This means a referencing system is required to navigate the elements in a 
manner dictated by the object being modeled. 

25 
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SUMMARY OF THE INVENTION 
[0004] In accordance with aspects of the present invention, techniques are 
disclosed for describing topology present in an image bitmap of an object by tracing 
the bitmap while simultaneously generating an XML file with nesting structure based 
5 on the binary branching topology of the bitmap. In one embodiment, the topology of 
the object is represented as a skeleton bitmap. As the skeleton is traced, grouping 
and line elements are added to an XML file when either 0 or 2 neighboring pixels are 
encountered. Attributes are added to the grouping and line elements in such a way 
as to retain the geometry of the skeleton. In one embodiment the element and 
10 attribute names are specified such that the generated XML files can be rendered in a 
scalar vector graphics (SVG) viewer. The generated XML file may be used in further 
processing to store attributes representing physical properties of the structure 
without requiring a location referencing system. 
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BRIEF DESCRIPTION OF THE DRAWINGS 
[0005] The foregoing aspects and many of the attendant advantages of this 
invention will become more readily appreciated as the same becomes better 
understood by reference to the following detailed description, when taken in 
5 conjunction with the accompanying drawings, wherein like reference numerals refer 
to like parts throughout the various views unless otherwise specified: 
[0006] Figure 1 illustrates an exemplary computing device that may be used in 
one exemplary embodiment of the present invention; 

[0007] Figure 2 is a logical flow diagram of an exemplary process for generating 
10 an XML file in accordance with the present invention; 

[0008] Figure 3 is a logical flow diagram of an exemplary process for tracing a 
skeleton in accordance with the present invention; 

[0009] Figure 4 is a logical flow diagram of an exemplary process for adding line 
elements to the XML file in accordance with the present invention; 
15 [0010] Figure 5a shows a representative explanatory figure for a skeleton; 

[0011] Figure 5b shows an exemplary implementation of the restriction on node 
geometry; 

[0012] Figure 6a shows a representative XML file in accordance with the present 
invention; 

20 [0013] Figure 6b shows a representative SVG image in accordance with the 
present invention; 

[0014] Figure 7a shows a representative XML file listing showing the application 
of the present invention for data storage; 

[0015] Figure 7b shows a representative SVG image in accordance with the 
25 present invention for data storage. 
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DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 
[0016] Embodiments of methods for generating XML-based documents 
describing the topology of objects are described herein. In the following description, 
numerous specific details are set forth to provide a thorough understanding of 
5 embodiments of the invention. One skilled in the relevant art will recognize, 
however, that the invention can be practiced without one or more of the specific 
details, or with other methods, components, materials, etc. In other instances, well- 
known structures, materials, or operations are not shown or described in detail to 
avoid obscuring aspects of the invention. 

10 [0017] Reference throughout this specification to "one embodiment" or "an 
embodiment" means that a particular feature, structure, or characteristic described in 
connection with the embodiment is included in at least one embodiment of the 
present invention. Thus, the appearances of the phrases "in one embodiment" or "in 
an embodiment" in various places throughout this specification are not necessarily 

15 all referring to the same embodiment. Furthermore, the particular features, 
structures, or characteristics may be combined in any suitable manner in one or 
more embodiments. 

[0018] Throughout this specification and in the claims, several terms of art are 
used. These terms are to take on their ordinary meaning in the art from which they 
20 come, unless specifically defined herein (such as follows) or the context of their use 
would clearly suggest otherwise. 

[0019] Extensible markup language (XML) is specified by the W3C and can be 
found at ( http://www.w3.orq/XML/ ). The term element refers to the basic unit of an 
XML document. The element may contain attributes, other elements, content, and 
25 other building blocks for an XML document. 

[0020] The term 'tag' refers to a command inserted in a document that delineates 
elements within XML documents. Each element usually has no more than two tags: 
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the start tag and the end tag. It is possible to have an empty element (with no 
content) in which case one tag is allowed. 

[0021] The content between the tags is considered the element's 'children 1 (or 
descendents). Hence, other elements embedded in the element's content are called 
5 'child elements' or 'child nodes' of the element. Text embedded directly in the 
content of the element is considered the elements 'child text nodes'. Together, the 
child elements and the text within an element constitute that element's content. 
[0022] The term 'attribute' refers to an additional property set to a particular value 
and associated with an element. Elements may have an arbitrary number of 
10 attribute settings associated with them, including none. Attributes are used to 
associate additional information with an element that is not included in the elements 
content. 

[0023] Scalar vector graphics, SVG, is a language for describing two-dimensional 
vector and mixed vector/raster graphics in XML. The specification can be found at 

15 http://www.w3.orq/tr/svq/ ). 

[0024] Embodiments of the present invention take advantage of the grouping 
properties of SVG, namely the 'g' (or grouping) element. The 'g' element is a 
container element for grouping together related graphics elements. A 'g' element 
can contain other 'g' elements within it to an arbitrary depth. The 'transform' 

20 attribute transforms user space coordinates and lengths on sibling attributes on the 
given element and all of it descendents. The line elements are a graphics element 
that is defined by some combination of straight lines and curves. The embodiments 
described herein take advantage of these SVG tags to reproduce the geometry of 
the object. 

25 [0025] Xpath is a language for addressing parts of an XML document and has 
been endorsed by the W3C; the specification can be found at 
http://www.w3.orq/tr/xpath . Xpath models an XML document as a tree of nodes. 



Attorney Docket: 7016.P001 Express Mail No. EV320120133US 

[0026] With reference to Figure 1, one exemplary system for implementing 
embodiments of the invention includes a computing device, such as computing 
device 110. Computing device 110 typically includes at least one processing unit 
112 and system memory 113. System memory 113 may comprise volatile memory 
5 (such as RAM), non-volatile memory (such as ROM), or some combination of the 
two. The system memory 113 typically is used to store runtime instances of an 
operating system 115, one or more applications 117, and one or more data files 119. 
In one embodiment application 117 includes an XML generator 300, a skeleton 
bitmap file 119, and an XML document 120. Although illustrated here in the context 
10 of an XML generator 300, embodiments of the present invention has equal 
applicability to other types of applications, as will become apparent to those skilled in 
the art. 

[0027] In addition to the foregoing components, device 110 may have additional 
features or functionality. For example, computing device 110 may also have input 

15 device(s) 130, such as a mouse, keyboard, touch input device, etc. Output device(s) 
131, such as display, printers, etc. may also be included. Computing device 110 
may also have additional data storage devices, such as removable storage device, 
133, and non-removable storage device, 132. System memory 113, removable 
storage 133, and non-removable storage 132 are examples of computer storage 

20 media. Computer storage media may include removable and non-removable, 
volatile and non-volatile media implemented in any method or technology for storage 
of information, such as data structures, computer readable instructions, program 
modules, or other data. Computer storage media includes, but is not limited to, RAM, 
ROM, EEPROM, CD-ROM, flash memory or other memory technology, digital 

25 versatile disks (DVD) or other optical or magnetic storage devices. 

[0028] Computing device 110 may also include communication connection 134 
that allows the device to communicate with other computing devices 135. 
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Communication connection 134 is one example of communication media. Data 
structures, computer readable instructions, program modules or other data in a 
modulated data signal may typically embody communication media. The term 
computer readable media as used herein includes both storage media and 
5 communication media. 

[0029] Figure 2 is a logical flow diagram of an exemplary process for generating 
an XML document in accordance with one embodiment of the invention. The 
exemplary process 200 includes loading a bitmap image 201, generating a binary 
image 202, generating a skeleton 203, generating the XML document 300, and then 

10 storing data in the XML document 205. 

[0030] Step 201 involves loading a bitmap image of the object being modeled. In 
general, a bitmap consists of an array of pixels where each pixel is specified by a 
corresponding color. In one embodiment, a red, green, blue (RGB) value is 
specified for each pixel. 

15 [0031] Processing continues at step 202, which generates a binary bitmap. The 
object to be modeled can usually be separated from the background using standard 
image processing techniques (e.g., see Russ, John, The Image Processing 
Handbook, 2002). Other filtering and processing techniques are applied to separate 
the object from the background. The processing of the image into a suitable binary 

20 image is well established and will not be discussed herein. 

[0032] Step 203 generates a skeleton bitmap representing the topology of the 
object being modeled. The term 'skeleton' refers to a binary bitmap (the pixels are 
either white or black). In one embodiment, all foreground pixels can have three 
neighboring foreground pixels at most. Those foreground pixels that have three 

25 neighbors are considered nodes, those with only one neighbor are considered 
endpoints, and those with two neighbors are considered part of line elements. 
Figure 5a shows an exemplary skeleton with the starting pixel specified. In one 
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embodiment, a further restriction on the node geometry is applied such that pixels 
adjacent to nodes cannot be touching. An exemplary implementation of this 
restriction is shown in Figure 5b. All lines in the skeleton have parent lines except 
the root, or first, line, which has no parents. A child line is defined as the line 
5 element with a start point located at a terminal point of a parent line. Each parent 
line has two child lines. In one embodiment, another restriction on the skeleton is 
that it has no looping morphology. The skeleton can be generated using well-known 
morphological thinning techniques discussed in the literature, such as disclosed in 
US patent 5,574,803, incorporated herein in its entirety by reference. This is well 
10 documented in the literature and will not be discussed further herein; (e.g., see 
http://www.dai.ed.ac.uk/HIPR2/skeleton.htm , Russ, John, The Image Processing 
Handbook, 2002). 

[0033] Figure 3 is a logical flow diagram 300 of an exemplary process for 
generating an XML document in accordance with one embodiment of the invention. 

15 The process begins at Step 301 , which loads a skeleton representing the topology of 
the object being modeled. The process continues to step 302 where a root element 
is created in an empty XML document. In one embodiment the root element name is 
'svg'. Processing continues at steps 303 and 304, where the branch subroutine 305 
is initialized with the coordinates of the starting pixel located on an endpoint of one 

20 of the branches in the skeleton. The node location within the XML document must 
be maintained while tracing the skeleton. XPath models an XML document as a tree 
of nodes. In one embodiment an XPath string is used to specify this location. 
[0034] In further detail, step 305 recursively calls itself with the XPath string, the 
coordinates of the parent line, the starting point of the current line and the current 

25 point being traced on the skeleton. Step 305 examines the eight pixels adjacent to 
the current pixel, four diagonal and four on the side, to determine if any are part of 
the skeleton. RGB values are used to differentiate the skeleton from the 
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background. In one embodiment, the skeleton is defined as being composed of 
black pixels where the RGB value is (0,0,0) and the background is set to white 
(255,255,255). If there is only one neighboring pixel then processing continues to 
step 306. Step 306 sets the selected pixel to the background color to avoid retracing 
5 the skeleton. It then calls step 305 again with the coordinates of the selected pixel. 
[0035] If zero neighboring pixels are found in step 305, then processing continues 
to step 400. In one embodiment, step 400 adds a grouping and line element to the 
XML file using the coordinates of the parent and child lines (block 400). Processing 
then continues to step 308 where the subroutine exits. 

10 [0036] If two neighboring pixels are found in step 305, then processing continues 
to step 400. In one embodiment step 400 adds a grouping and line element to the 
XML file using the coordinates of the parent and child lines and Xpath string. 
Processing continues to step 309 where the selected pixels are set to background 
color. Processing continues to step 310, where step 305 is called twice using the 

15 coordinates of the two selected points. Step 310 updates the parent line 
coordinates. Step 310 also appends the first points XPath string with 7g[1]' and 
appends the second points Xpath string with 7g[2]'. Figure 6a shows the XML file 
listing generated from this process using the exemplary skeleton 5a. Figure 6b is 
shows the result of rendering the XML code of Figure 6a with an SVG viewer. 

20 [0037] Figure 4 is a logical flow diagram 400 of an exemplary process for 
generating an XML document in accordance with one embodiment of the invention. 
The process starts at step 401, which adds a grouping element to the XML 
document at a location specified by the XPath string. In one embodiment the 
grouping elements name is 'g'. Processing continues to step 402, where the angle 

25 the parent line would be rotated to align with the child line is calculated. Processing 
continues at step 403 where attributes are added to the grouping element that 
specify the transformation which will be applied to the line element added at step 
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404. In one embodiment the attribute name is 'transformation' with property names 
'translation' and 'rotation'. In SVG, the translation property is defined by an Y and 
y value and the rotation property is defined by an angle. In one embodiment, the 'x' 
value is set to '0' and the y value is set to the length of the parent line. Processing 
5 continues at step 404, where a line element is added as a child of the grouping 
element. In one embodiment the line element name is line'. Processing continues 
to Step 405, which adds a length attribute to the line element. In one embodiment 
the name of the attribute is 'y2' and the value of the attribute is length of the line. 
[0038] Processing continues at block 205 in Figure 2. Block 205 demonstrates 

10 how attributes can be added to the line elements of the XML document. Figure 7a 
shows an exemplary file-listing showing the attribute 'a' has been added to the 8.2 
and 8.5 length lines. Attribute 'a' has been set to '3' for the 8.2 length line and to '2' 
for the 8.5 lines. Figure 7b shows figure 7a rendered in an SVG viewer. Note how 
the width of the lines has been modified to reflect the attribute values. 

15 [0039] It is contemplated that embodiments of the invention may be configured as 
a software-based XML generator. Thus, embodiments of this invention may be used 
as or to support a software program executed upon some form of processing core 
(such as the CPU of a computer) or otherwise implemented or realized upon or 
within a machine-readable medium. A machine-readable medium includes any 

20 mechanism for storing or transmitting information in a form readable by a machine 
(e.g., a computer). For example, a machine-readable medium can include such as a 
read only memory (ROM); a random access memory (RAM); a magnetic disk 
storage media; an optical storage media; and a flash memory device, etc. In 
addition, a machine-readable medium can include propagated signals such as 

25 electrical, optical, acoustical or other form of propagated signals (e.g., carrier waves, 
infrared signals, digital signals, etc.). 
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[0040] The techniques described above provide several advantages over the 
prior art. One advantage is that the data structure (i.e., the XML document) is 
generated to match the topology of the object being modeled. This removes the 
need for a referencing system when navigating the structure in a connected fashion. 
5 The logic of the object is the structure of the data itself. This simplifies the modeling 
of a process where the topology of the structure determines local interactions. For 
example, the electrophysiological processes of neurons are modeled as a series of 
compartments, where the state of a compartment at the next time step is determined 
by the state of its neighboring compartments (see, e.g., Schilstra, M, "A finite state 

10 automaton model for multi-neuron simulations," Neurocomputing 44-46 (2002) 1 141- 
1148). Furthermore, this simplifies the modeling process because selection of the 
neighboring elements is achieved by selecting neighboring nodes in the XML file. 
Another advantage is that once the topology of the object is expressed in XML, 
attributes can be added to the line elements to store physical characteristics of the 

15 object. Attributes saved in the line elements can be easily selected using standard 
XML navigation tools. In addition to duplicating the nesting structure, some 
embodiments may also retain the geometry of the object if appropriated SVG tags 
are used to generate the XML file. Also, when the XML file is generated using SVG 
tags it can be rendered in a web browser using a SVG viewer. Such an SVG viewer 

20 can be found at http://www.adobe.com/SVG/main.html 

[0041] The above description of illustrated embodiments of the invention, 
including what is described in the Abstract, is not intended to be exhaustive or to 
limit the invention to the precise forms disclosed. While specific embodiments of, 
and examples for, the invention are described herein for illustrative purposes, 

25 various equivalent modifications are possible within the scope of the invention, as 
those skilled in the relevant art will recognize. 
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[0042] These modifications can be made to the invention in light of the above 
detailed description. The terms used in the following claims should not be construed 
to limit the invention to the specific embodiments disclosed in the specification and 
the claims. Rather, the scope of the invention is to be determined entirely by the 
5 following claims, which are to be construed in accordance with established doctrines 
of claim interpretation. 
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